Picture for Xuhong Zhang

Xuhong Zhang

Faithful Bi-Directional Model Steering via Distribution Matching and Distributed Interchange Interventions

Add code
Feb 05, 2026
Viaarxiv icon

Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory

Add code
Jan 26, 2026
Viaarxiv icon

Large Language Model-Powered Evolutionary Code Optimization on a Phylogenetic Tree

Add code
Jan 20, 2026
Viaarxiv icon

Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict Regularization

Add code
Jan 13, 2026
Viaarxiv icon

ToolGate: Contract-Grounded and Verified Tool Execution for LLMs

Add code
Jan 08, 2026
Viaarxiv icon

IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation

Add code
Jan 06, 2026
Viaarxiv icon

Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging

Add code
Nov 13, 2025
Viaarxiv icon

RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation

Add code
Nov 13, 2025
Figure 1 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Figure 2 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Figure 3 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Figure 4 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Viaarxiv icon

MEraser: An Effective Fingerprint Erasure Approach for Large Language Models

Add code
Jun 14, 2025
Viaarxiv icon

VModA: An Effective Framework for Adaptive NSFW Image Moderation

Add code
May 29, 2025
Figure 1 for VModA: An Effective Framework for Adaptive NSFW Image Moderation
Figure 2 for VModA: An Effective Framework for Adaptive NSFW Image Moderation
Figure 3 for VModA: An Effective Framework for Adaptive NSFW Image Moderation
Figure 4 for VModA: An Effective Framework for Adaptive NSFW Image Moderation
Viaarxiv icon